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We report on a search for the standard model Higgs boson produced in association with a W or 
Z boson in pp collisions at y's = 1.96 TcV recorded by the CDF II experiment at the Tevatron in a 
data sample corresponding to an integrated luminosity of 2.1 fb _1 . We consider events which have 
no identified charged leptons, an imbalance in transverse momentum, and two or three jets where 
at least one jet is consistent with originating from the decay of a b hadron. We find good agreement 
between data and background predictions. We place 95% confidence level upper limits on the 
production cross section for several Higgs boson masses ranging from 110 GeV/c 2 to 150 GeV/c 2 . 
For a mass of 115 GeV/c 2 the observed (expected) limit is 6.9 (5.6) times the standard model 
prediction. 

PACS numbers: 14.80.Bn,13.85.Rm 



'Deceased 



tWith visitors from "University of Massachusetts Amherst, 



4 



The Higgs boson is the last particle of the standard 
model (SM) of particle physics that remains to be dis- 
covered. The existence of the Higgs boson is expected to 
be the direct physical manifestation of the mechanism 
that provides mass to fundamental particles pQ. Ex- 
pectations from electroweak data collected at the Teva- 
tron, LEP, and SLD indirectly constrain the Higgs bo- 
son mass to ma < 157 GeV/c 2 at 95% confidence 
level (C.L.) [2J. Direct searches at LEP have excluded 
m H < H4.4 GeV/c 2 at 95% C.L. 0. Upper limits on 
the production cross section from searches in the region 
110 < tti < 135 GeV/c 2 remain well above the stan- 
dard model prediction [2J, and greatly benefit from im- 
provements of the experimental sensitivity. In this mass 
region, bb is the main decay mode. The b quarks fragment 
into jets of hadrons, and the signal can be reconstructed 
as a resonance in the invariant mass distribution of the 
two jets. Large multi-jet backgrounds can be reduced 
by searching for a Higgs boson (H) with an associated 
vector boson V (V — W, Z). 

This Letter presents a search for the standard model 
VH associated production in events with 6-quark jets and 
large missing transverse energy with data corresponding 
to an integrated luminosity of 2.1 fb _1 . This analysis sig- 
nificantly increases the acceptance for signal with respect 
to previous Tevatron searches [3J [5] and introduces ad- 
vanced analysis methods. We consider ZH production, 
where Z — > vv and the neutrinos (v) escape detection, 
or Z — > ii when both charged leptons (£) are undetected 
or are identified as jets. For WH production we are sen- 
sitive to events where W — > ev or W — > tv when the 
charged lepton is identified as a jet, and W — > Iv when I 
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is undetected. The WH events accepted by this analysis 
contain 50% W ->■ tv, 30% W -> iiv and 20% W ev. 
Critical challenges for this analysis are to achieve a high 
signal-to-background (S/B) ratio and to model the multi- 
jet background production accurately. We employ artifi- 
cial neural networks (ANNs) [3] to improve the event se- 
lection and signal discrimination and implement a novel 
data-driven determination of the multi-jet background. 

CDF II is a multipurpose detector that is described 
in detail elsewhere [8]. Jets are reconstructed from 
energy depositions in the calorimeter towers using a jet 
clustering cone algorithm [5] with a cone size of radius 
AR = ^{A4>) 2 + (Ari) 2 = 0.4. In addition to standard 
jet energy corrections [5], we further correct using mo- 
mentum measurements provided by the tracker, using a 
method similar to that described in [TO]- The more pre- 
cise measurement of the jet energies improves the candi- 
date Higgs boson mass resolution by s» 10% and increases 
the signal acceptance by w 10%. Both the magnitude and 
direction of are recomputed after the jet energies are 
corrected. 

The events used in this search are selected by a three- 
level trigger system that selects events with ^ and two 
jet clusters. After offline reconstruction, the event selec- 
tion requires fir > 50 GeV, and the transverse energies 
E^ 1 and E^ 2 of the two jets with the highest transverse 
energy, J\ and J2 ("leading jets"), satisfy the conditions 
E^ 1 > 35 GeV and E^ 2 > 25 GeV. We consider an event 
to have three jets if the E T of the third leading jet, J 3 , is 
greater than 15 GeV. Events with four or more jets with 
Et > 15 GeV and \rj\ < 2.4 are rejected. Events passing 
these criteria are denoted as the "pre-tagged" sample. 
After these selections, the expected S/B ratio is around 
1/20 000. 

We veto events with at least one pr > 10 GeV/c iso- 
lated electron or muon [TT] , deliberately using fairly loose 
identification criteria. These selections ensure that the 
sample used in this analysis is statistically independent 
from the one utilized in the search for WH in a final state 
containing an identified charged lepton |12j . 

Large backgrounds originating from light-flavor jet 
production can be reduced by identifying b jets in the 
candidate events. Due to their relatively long lifetime, b 
and c hadrons can travel a few millimeters from the pri- 
mary vertex before decaying into lighter hadrons. Jets 
originating from a b quark can be identified ("tagged") 
by the SECVTX algorithm [13], which reconstructs ver- 
tices that are significantly displaced from the pp interac- 
tion point, and the jetprob algorithm [TJ], which clas- 
sifies jets using the probability that tracks within the jet 
are consistent with originating from the primary vertex. 
To enhance the expected signal significance we subdivide 
the sample into three independent categories: events with 
two jets tagged by secvtx (SV+SV), events with one jet 
tagged by secvtx and another by jetprob (SV+JP) 
and events with only one jet tagged by secvtx (SV). 
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TABLE I: Comparison of the total number of expected and 
observed events in the signal region for different ^-tagging cat- 
egories. The uncertainties contain both MC statistical error 
and systematic uncertainties. 



Process 



SV+SV or SV+JP 



SV 



Multi-jet 
Single Top 
Top Pair 
Di-boson 
W + h.f. 
Z + h.f. 



120.1 ±21.3 
15.7 ±3.0 
54.5 ±7.9 
9.2 ± 1.8 

32.0 ± 14.7 

22.1 ± 11.5 



941.2 ±86.0 
43.2 ±7.9 

124.5 ± 17.0 
35.6 ±6.8 

296.9 ± 129.5 

107.0 ±45.8 



Total Prediction 
Observed 



254 ± 39 
253 



1548.4 ± 168.1 
1443 



Expected Signal for mH = 115 GeV/c 2 



ZH -> vvbb 
WH (l)vbb 
ZH -> (U)bb 



1.8 ±0.2 
1.6 ±0.2 
0.07 ±0.01 



2.1 ±0.2 
1.8 ±0.2 
0.09 ±0.01 



FIG. 1: The distribution of ANNm.t for double-tagged events. 



The selected sample is dominated by background from 
the production of multi-jet (MJ), top quark (pair and 
electroweak production), W or Z plus jets, and WW, 
WZ or ZZ events. Significant in multi-jet events 
appears when b quarks decay semileptonically or when 
jet energies are mismeasured. In both cases ^Jt is often 

aligned with E^ 2 . Therefore, events with A0(^ T , E^?) < 
0.4, no identified leptons, and 50 < < 70GeV are 
used to measure the rates with which the heavy-flavor 
jets (h.f., originating from a b or c quark) from multi-jet 
production and light flavor jets mistakenly identified as 
b jets ("mistags") are tagged. The tagging rate (TR) is 
parametrized as a function of i/r [H] of the event and 
Et, r], and ( of the jet. The observable ( is defined 
as C = cEi PT.txack/^T where p^track includes tracks 
within a jet with a significant impact parameter and 



0.5 < p l T 



track 



< 200 GeV/c. Jets originating from b 



quarks are expected to have a large £. The multi-jet 
background in the single (double) tagged sample is de- 
termined by the probability to tag one (two) jet(s) from 
the pre-tagged (single-tagged) sample JTJ [T3] , after sub- 
tracting all Monte Carlo (MC) simulated contributions. 
The validity of the tagging rate modeling is verified in 
various control regions, which are defined below. The 
remaining backgrounds are estimated using PYTHIA [16] 
simulations, and single top production is simulated with 
MADEVENT [17]. The signal MC samples are generated 
with pythia. The normalizations of the MC samples are 
described in [TT] . 

We start the selection of the signal region by re- 
quiring no identified charged leptons, > 50GeV, 

A(j)(f T7 E^) > 1.5, and A<j)(^ T , E^ 3 ) > 0.4. These se- 
lection criteria remove ss 10% of the signal in the pretag 
sample while reducing the backgrounds by approximately 
an order of magnitude. We employ an ANN, denoted 



as ANNmj, using kinematic variables to separate sig- 
nal from multi-jet background. To discriminate against 
events with due to mismeasurements in the calorime- 
ter, we use the momentum imbalance in the tracker, 
|S]. The magnitude of fa, /t\ the angle between 
them, the azimuthal angles between fa, and the jet 
directions, and several other less discriminating variables 
are used as inputs to ANNmj [II]- The distribution of 
ANNmj, shown in Fig. [I] peaks at ±1 for the signal and 
at —1 for the backgrounds that are due to mismeasured 
jets. Selecting events with ANNmj > rejects over 50% 
of the total background and retains 95% of the signal, 
yielding an S/B ratio of « 1/250, which is similar to the 
one obtained in the WH search 12 1. This region is de- 
fined as the signal region and is analyzed for the presence 
of the Higgs boson signal. 

In order to avoid potential bias in the search, we test 
our understanding of the SM background in several con- 
trol regions where the amount of signal events is negli- 
gible. The control region called EWK, sensitive to elec- 
troweak processes and top production, contains events 
with at least one lepton and Acf>(fa, E^ 2 ) > 0.4. We also 
define several control regions dominated by multi-jet pro- 
cesses where we have no identified leptons. The region 
denoted as MJ1 contains events with Acf>{fa, E^ 2 ) < 0.4 
and fa > 70 GeV. The region denoted as MJ2 contains 



events with A<f>(fa,E^) > 1.5 



A(j)(fi T ,E T 



> 0.4, 



and ANN M j < —0.5. The predictions of the multi-jet 
background are tested in MJ1 and MJ2. The distribu- 
tions of kinematic variables, such as the invariant mass of 
the two leading jets m( Ji, J2), have been found to be in 
agreement with observations in the control regions [TT] . 

To achieve a greater separation between signal and 
background we deploy a second ANN, denoted as 
ANN sig , for discriminating the remaining backgrounds 
from the expected signal. Six input variables are used 
in ANN s i g : the invariant mass of the two leading jets 
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m(J,J 2 ) (GeV/c 2 ) 



FIG. 2: Dijet invariant mass distribution for double-tagged 
events in signal region. 

m(Ji, J 2) (Fig.pl, the invariant mass of the and all 
jets, H T —$t, |*t trackmet [18] and the maxi- 

mum AR(E^' , E^ k ). The variable trackmet is the out- 
put of a ANN developed using tracking information to 
enhance the separation of events with real i&r- The most 
discriminating variable of the ANN s i g is m(Ji, J2). 

The distribution of ANN S j g is shown in Fig. [3] for 
single- and double-tagged events. The number of sig- 
nal and background events after the final selection are 
shown in Table [I] Since no significant excess is observed, 
we compute 95% C.L. upper limits for the Higgs bo- 
son production cross section times the branching fraction 
B(H — > 66). For m H =\lb GeV/c 2 we expect a total of 
4.0 (3.5) signal events with one (two) 6-tagged jets [15] . 

We analyze the binned ANN s i g discriminant distribu- 
tion to test for a WH or ZH signal in the presence of 
SM backgrounds. The systematic uncertainties included 
in the calculation are classified as correlated (uncorre- 
cted) depending on if they do (do not) affect both signal 
and the background processes [TTJ [TS] . The uncorrelated 
systematic uncertainties are the multi-jet normalization 
(between 5.5% and 20.6%) and MC statistical fluctua- 
tions. Additionally we assign the following uncertainties 
due to cross sections: 15.9% and 15.2% to single top in s- 
and i-channels, 8.6% to top pair, 11.5% to diboson and 
40% to W + h.f. and Z + hi. The shapes obtained by 
varying the TR probabilities by ±la are applied as sys- 
tematic uncertainties to each bin of ANN s ; g . The corre- 
lated systematic uncertainties are the following: luminos- 
ity measurement (6.0%), 6-tagging efficiency in MC (be- 
tween 4.3% and 12.4%), trigger efficiency (<3%), lepton 
veto efficiency (2%), parton distribution function uncer- 
tainty (2%) and 3.8-13.0% for jet energy scale (JES) [5]. 
We also assign systematic uncertainties on the shape of 
ANN s i g due to JES and trigger efficiency uncertainties. 




FIG. 3: The distribution of ANN sig for (a) single- and (b) 
double-tagged events. 

Initial- and final-state-radiation systematic uncertainties 
(between 1% and 5%) are applied to the signal predic- 
tions. 

Including all the uncertainties, the expected and ob- 
served upper limits at the 95% C.L. on VH production 
cross section times branching fraction Br(H — > 66) are 
shown in Table [II] Expected limits are obtained by gener- 
ating pseudo-experiments from the expected SM ANN s i g 
shapes to calculate the median ZH and WH contribu- 
tion which could be excluded at the 95% C.L. in the zero 
signal hypothesis. The limits are computed using the 
Bayesian likelihood method [20 with a flat prior proba- 
bility for the signal cross section and Gaussian priors for 
the uncertainties on acceptance and backgrounds. We 
combine the search channels SV+SV, SV+JP, and SV 
by taking the product of their likelihoods and simultane- 
ously varying the correlated uncertainties. The observed 
limits agree with the expected ones. 

In summary, we have performed a direct search for the 
SM Higgs boson decaying into 6-jet pairs using data with 
integrated luminosity of 2.1fb _1 accumulated in Run II 
by the CDF II detector. The combination of all improve- 
ments described above increases the sensitivity of this 
search by a factor of two with respect to 0] , and by 30% 
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TABLE II: Expected and observed 95% C.L. upper limits, 
with ratios to SM cross section. 



(GeV/c 2 ) 


Expected 
(Pb) 


Observed 
(Pb) 


rteltlO 

Expected 


riatio 
Observed 


110 
115 
120 
130 
140 
150 


1.3 
1.2 
1.2 
1.0 
0.9 
0.8 


1.5 
1.5 
1.5 
1.4 
1.0 
1.0 


4.9l?:i 

7 o+2.9 
' - z -2.1 

10.3±« 
18.6±™ 

43.3±H:S 


5.8 
6.9 
8.9 
14.4 
21.0 
49.8 



with respect to [5] once the difference in luminosity is 
accounted for. 
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